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In the paper a new fitting function is suggested, which can essentially increase the 
existing instrumentation for fitting of asymmetric peaks with the only maximum. 



I. INTRODUCTION 

During experimental data processing often there is a task to fit arbitrary distribution with 
the only peak to some function with a set of free parameters. This can be necessary for more 
precise determination of peak position, or full width at half maximum (devise resolution), 
or for the signal form approximation using Monte Carlo events for subsequent analysis of 
data distribution and so on. Available set of fitting functions does not always matches the 
experimental requirements, especially for high statistics (it is difficult to achieve suitable 
confidence level). 

In practice most often the sum of the three Gaussian distributions is used to fit such 
histograms. In most cases this fit provides suitable confidence level. At least one can add 
more and more Gaussians until confidence level is admissible. Disadvantage of this method 
is that every Gaussian used is symmetric, so in principle one can not used sum of Gaussians 
with common center to fit asymmetric distributions, and if all Gaussians have different 
centers, then it is difficult to provide the only maximum of fitting function and monotony 
to the right and left of maximum. 

Another often used function is a spline of two halves of different Gaussians. This function 
is good in case that the function "tails" can be approximated with Gaussian. However for 
high statistics the major experimental distributions far of peak have exponential or power- 
of-x dependence, which do not match Gaussian. 

Logarithmic Gaussian distribution is also often used (at SLAG it is called "Novosibirsk" 
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where Xp is a peak location (function maximum), h is full width at half maximum (FWHM), 
A is asymmetry parameter, A is normalization factor. 
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As is seen from (l2|), for A > the boundary coordinate Xm < Xp and all a; > x^. If A < 0, 
then Xm > Xp and x < Xm- Using these notations one can rewrite this function, excluding 
the inconvenient variable Xm- 

^ (5) 



F^{x; Xp, h,\) = Jl-^_-\^ exp -\ - ^ In^ (l + 2zX^^) 



Function is equal to for all x, for which the logarithm argument is negative. Possible 
values of parameters: > 0, A is arbitrary, Xp is arbitrary. For A ~ the formulae have an 
ambiguity of the type 0/0, so some parameters should be expanded to the Tailor series. 
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For A = the function converts to 
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that is Gaussian distribution with root mean square 

\l < (x — x„V > = — ,^ ^ — - — . 
V ^ 2v/2h^ 2.3548 

This function is convenient for fitting the distributions with abrupt spectrum end. How- 
ever many experimental distributions are more smooth, and suggested in this paper function 
can be more successful. 



II. CONVOLUTION OF GAUSSIAN AND EXPONENTIAL DISTRIBUTIONS 



It is suggested to build the fitting function on the base of convolution of Gaussian and 
exponential distributions, which can be easily derived: 
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Integral of this function over all x equals 1. Such a function was first used by the author 
in 2004 at SLAG (BaBar Note # 582) for fitting the deposited energy distributions in 
calorimeter with the aim of peak position and resolution determination for the algorithms 
of absolute photon energy calibration, and despite some technical difficulties, this function 
proved to be enough convenient for fitting such distributions, especially for high statistics. 
Technical difficulties appear when the argument of ERF function is big 
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If the argument of ERF function in formula (fTTl ) is denoted as z 
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then the formula f lTTl) looks like: 
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\i z +00, then exponential index also goes to infinity, and so the ambiguity of the type 
■ 00 arises. Because of finite accuracy of computer calculations, this difficulty appears 
rather fast, just for moderate values of z ~ 10. 
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To avoid this problem one can use the asymptotic expansion 
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Substituting this expansion to (fT4ll . we obtain 
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Here for big z values no ambiguities appear, function goes to 0. This very expansion allows 
to find the limit for A — * 0. Indeed, 
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Function Fbi plots for several sets of parameters are presented in Fig.lH 
Sometimes the integral distribution can be useful: 
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Usage of Fbi to fit the distributions would be more convenient, if the free parameter is 
Xm — location of function maximum, instead of Xg. Equation for the search for Xm looks 
rather complicated: 
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Figure 1: Function Fbi{x; Xg,ag, Ai) plots for cr^ = 1, Xg = and several values of Ai 
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For p 1 Zm ^ +00, and here we also can derive approximate solution: 
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Figure 2: Function Fp{fi) plot (points) and approximating cubic spline 



Let us return to the equation (i23l l. which should be solved in order to find Fbi maximum 
location. Let us transform the interval of p variable to the interval (0, 1): 
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At the ends of the interval we know the solution: 
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Let us look for approximating function in the form: 
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Function Fp{fi) plot and the approximating cubic spline with 5 knots are presented in Fig. El 
Root mean square deviation equals 4.1 ■ 10""^, maximum error of interpolation 2.2 ■ 10~^ is 
achieved at = 1.0 ■ 10~^. Spline coefficients are cited in TablelB Now we can calculate the 
shift of peak position vs Gaussian center: 
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The function AXmg can be easily implemented in any programming language, using the 
above formulae. 
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Table I: Spline of deficiency 2 coefficients for approximation of Fp{fj,) function. 



Knot coordinate 





0.25 


0.5 


0.75 


1 


Spline value 


1.00298 


0.97574 


1.05593 


1.03058 


0.99898 


Spline derivative 


-0.29472 


0.19545 


0.27768 


-0.35485 


0.33206 



III. BUILD OF FITTING FUNCTION ON THE BASE OF Fbi 

In some cases the function Fbi itself can be suitable for fitting. However more often this 
function is not enough flexible to provide satisfactory confidence level with experimental 
distribution. 

One could try to use the function which is the convolution of the three distributions: 
Gaussian and two different exponential. Such a function could be useful if the distribution 
"tails" both to the right and left from the peak do not match Gaussian distribution. 

However in this case and in other difficult ones the distributions are fitted more success- 
fully to the sum of different Fbi functions with the common parameter — peak position Xm- 
For the sum of two functions one can use the following expression: 

^2Bi(a;;a;m,^,cri, Ai,cr2, A2) = cos^^-Fbi {x;Xm - AX„<,(o-i, Ai), (Xi, Ai) + 

(34) 

+ sin^^ ■ Fbi {x; Xm - AXrag{cr2, X2),cr2, A2) 

Sample of using such a function is presented in Fig. [3] (one more parameter is added — 
common factor A). 

In principle for complicated cases one can use the sum of more Fbi functions. However 
for so many free parameters the likelihood function minimization can be unstable, and one 
need to help MINUIT program. The simplest and enough effective trick is optimization of 
parameters in turn, initially fixed at some reasonable values. 

IV. CONCLUSION 

For fitting the smooth distributions with one peak a function is suggested, which is the 
convolution of Gaussian and exponential distributions. 

In cases when this function is not enough fiexible to provide satisfactory confidence level, 
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Figure 3: Distributions vs invariant masses of photon pairs. Left — decay photons of ry, right — 
decay of 7r°. Fit function FIT2FB1. 



one can use the sum of several functions with different parameters, but the same peak 
position. 



